Qy 14 93 CCACAGCCCAGGCCTCCAGCTCAGCAGTGACTGTGGCCATGGTCCCCAAGACCTCTATAT 1557 

I I I I I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I t I it I I I I I I ! i I I I I I I I I I 

Db 3 6621 CCACAGCCCAGGCCTCCAGCTCAGCAGTGACTGTGGCCATGGTCCCCAAGACCTCTATAT 3 65 62 

Qy 15 58 TTGCTCTTTTATTTTTATGTCTAAAATCCTGCTTAAAACTTTTCAATAAACAAGATCGTC 1617 

I I I I I I I I I I I I I I I I I i | I I I I I | | | I M ! | | | | | | | | | | | | | | | | | | | | | | | M | | | | 

Db 3 6561 TTGCTCTTTTATTTTTATGTCTAAAATCCTGCTTAAAACTTTTCAATAAACAAGATCGTC 36502 



Qy 1618 AGGACC 1623 
I I ! I If 

Db 36501 AGGACC 36496 



RESULT 3 

HSU32674 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



FEATURES 

source 



intron 

gene 

CDS 



HSU32674 1293 bp DNA PRI 05-JUN-1996 

Human orphan receptor GPR9 { GPR9 ) gene, partial cds . 



U32674 
U32674 . 1 



GI :1002740 



Nguyen, T . 
Tsui, L. -C . 



Heng,H.H.Q., Saldivia, V . R. , 
Shi,X., George, S.R., 0'Dowd,B.F. 



BASE COUNT 
ORIGIN 



human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1293) 

Marchese,A., Heiber,M., Nguyen, T., Heng,H.H.Q., Saldivia, V . R . , 
Cheng, R. , Murphy, P.M., Tsui,L.-C, Shi,X., George, S.R., 0*Dowd,B.F. 
and Docherty , J . M . 

Cloning and chromosomal mapping of three novel genes, GPR9, GPR10, 
and GPR14, encoding receptors related to interleukin 8, 
neuropeptide Y, and somatostatin receptors 
Genomics 29 (2), 335-344 (1995) 
96115583 

2 (bases 1 to 1293) 
Marchese,A., Heiber,M., 
Cheng, R. , Murphy, P.M., 
and Docherty , J . M . 
Direct Submission 

Submitted ( 31- JUL-1995 ) B.F. O'Dowd, Department of Pharmacology, 
University of Toronto, 8 Taddle Creek Rd . , Toronto, Ontario M5S 
1A8, Canada 

Location /Qualifiers 

1. .1293 

/organism="Homo sapiens" 

/db_xref="taxon : 9606 M 

/chromosome="8" 

/map="8pll . 2-pl2" 

<1. .108 

/gene="GPR9" 

1. .1203 

/gene="GPR9" 

<109. .1203 

/gene="GPR9" 

/note="The initiating methionine has not been found; GPR9 

is an orphan receptor similar to the interleukin 8 

receptors " 

/codon_start=l 

/product="GPR9" 

/protein_id="AAC50505 . 1" 

/db_xref="GI: 1002741" 

/ translation="VSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCPQDFS 
LNFDRAFLPALYSLLFLLGLLGNGAVAAVLLSRRTALSSTDTFLLHLAVADTLLVLTL 
PLWAVDAAVQWVFGSGLCKVAGALFNINFYAGALLLACISFDRYLNIVHATQLYRRGP 
PARVTLTCLAVWGLCLLFALPDFI FLSAHHDERLNATHCQYNFPQVGRTALRVLQLVA 
GFLLPLLVMAYCYAHILAVLLVSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDIL 
MDLGALARNCGRESRVDVAKSVTSGLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGC 
PNQRGLQRQPSSSRRDSSWSETSEASYSGL" 
201 a 458 c 357 g 277 t 



Query Match 71.1%; Score 1187.2; DB 66; Length 1293; 

Best Local Similarity 98.9%; Pred. No. 1.5e-184; 

Matches 1195; Conservative 0; Mismatches 13; Indels 0; Gaps 

Qy 58 CCAGCCCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTG 117 

Ml I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I f I I I 

Db 8 6 CCTCCCCGTTCCCGCCCTCACAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTG 14 5 

Qy 118 CCGCCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGT 177 

I I M I I I I I I I I I II I I i I M I I I I II I I I II I I I I I I I I I I I I I I I ! | | ) | | | | | | | | | 
Db 146 CCGCCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGT 205 

Qy 178 GCTGTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGC 237 

I I I I I I I I I I I I I I I I I i ! M I I I I I I I I I I I I I I I I I I I I I | | I | | | | | | | | | | | | | | | 
Db 206 GCTGTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGC 2 65 

Qy 2 38 CAGCCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCG 2 97 

I I M I I I I I I I I II I I I I I I 1 M | I | | || | | | | | | | | | ) | | | f | | M I i M ! I I I I I M | 
Db 266 CAGCCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCG 325 

Qy 2 98 TGCTGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTG 357 

I M I ! I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I | | | | 
Db 32 6 TGCTGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTG 3 85 

Qy 358 TAGCAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGT 417 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M | | | | | | | | | | | | | | | | | | M | | 
Db 38 6 TAGCAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGT 445 

Qy 418 GGGTCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACG 477 

I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 44 6 GGGTCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACG 505 

Qy 4 78 CAGGAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCA 537 

M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 50 6 CAGGAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCA 5 65 

Qy 538 CCCAGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGG 597 

I I I I I ! I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I M | | | | | M | | | | | 
Db 5 66 CCCAGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGG 62 5 

Qy 598 GGCTCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGC 657 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I II I I 
Db 62 6 GGCTCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGC 68 5 

Qy 658 GCCTCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGG 717 

I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
Db 68 6 GCCTCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGG 74 5 

Qy 718 TGCTGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCC 777 

I II I I I I I I II I I I I I I M I II I II I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I | 
Db 74 6 TGCTGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCC 8 05 

Qy 77 8 ACATCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGG 8 37 

I I I I I I I I I ! I I I I I I I I I I II I I I I I I I I I I I I I I I II I I II I I I I M I I I I I I II I I I 

Db 80 6 ACATCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGG 8 65 

Qy 838 TGGTGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGG 8 97 

II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 8 66 TGGTGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGG 92 5 

Qy 8 98 TGGACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAG 957 

I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 92 6 TGGACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAG 985 

Qy 958 ACGTGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGC 1017 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I II II I 
Db 98 6 ACGTGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGC 104 5 



Qy 1018 TCTATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGG 1077 



I M I M I I I I I II I I I I I I I l I I I M | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | 

Db 104 6 TCTATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGG 1105 

Qy 1078 GCTGCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCT 1137 

Mil I I I I [ I I I M | | | | ! 

Db 1106 GCTGCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCT 1165 

Qy 1138 GGTCTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTT 1197 

I M I I ! I I I I I! I I I I I I I I 1 I II I I I I I I I I I I | | | | || | | | ) I ! I t I I I I I I I I I I I I 
Db 1166 GGTCTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTT 122 5 

Qy 1198 TCGCCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCT 1257 

I I f M I I I I I I I I I I I I I I I I I I i I I I I I I I I I II I I I I I I I I I I I I I I I I I ! I I I I I I I 
Db 1226 TCGCCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCT 1285 

Qy 1258 CTCCCCAA 1265 

I I M I I I I 
Db 1286 CTCCCCAA 1293 



RESULT 1 




V26557 




t n 
1 u 


V26557 standard; DNA; 1670 BP. 


XX 








V26557; 




XX 






DT 


14-AUG-1998 


(first entry) 


XX 






UP., 


Human IP-10/Mig receptor CXCR3 gene. 


Y Y 
A A 






KW 


ds; chemokine receptor; cellular signal; treatment; T cell; 


KW 


inflammatory 


disease; antiturnour; antiviral. 


Y Y 
AA 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualifiers 


FT 


CDS 


69. .1175 


FT 




/*tag= a 


TTT 

r i 




/product= "IP-10/Mig receptor CXCR3 " 


FT 


polyA_signal 


1534. .1539 


FT 




/ + tag= b 


FT 


polyA_si te 


1624 . . 1670 


FT 




/*tag= c 


XX 






PN 


W09811218-A1. 




XX 






PD 


19-MAR-1998 . 




XX 






PF 


lO-SEP-1997; 


97WO-US15915. 


XX 






PR 


31-MAR-1997; 


97US-0829839. 


PR 


10-SEP-1996; 


96US-0709838. 


XX 






PA 


(KOCH-) KOCHER INST THEODOR. 


PA 


(LEUK-) LEUKOSITE INC. 


XX 






PI 


Loetscher M, 


Mackay CR, Moser B, Qin S; 


XX 






DR 


WPI; 1998-207381/18. 


DR 


P-PSDB; W54371. 


XX 






PT 


DNA encoding 


CXC chemokine receptor 3 - inhibitors and promoters of 


PT 


which, are useful for treatment of inflammation or in anti-tumour or 


PT 


ant i -viral therapy 


XX 






PS 


Claim 6; Fig 


1; 137pp; English. 


XX 






cc 


The mammalian 


CXC chemokine receptor 3 (CXCR3) gene encodes a protein 


cc 


which can selectively bind one or more chemokines and can mediate 


cc 


cellular signalling and/or a cellular response in response. Inhibitors 



CC and promoters of mammalian CXCR3 can be detected and identified using 

CC host cells expressing CXCR3 . CXCR3 inhibitors can be used for treatment 

CC of inflammatory diseases which are T cell mediated. CXCR3 promoters are 

CC useful for antitumour or antiviral therapy. 

vv 

SQ Sequence 1670 BP; 272 A; 574 C; 436 G; 388 T; 0 other; 



Query Match 100.0%; Score 1670; DB 19; Length 1670; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1670; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


CCAACCACAAGCACCAAAGCAGAGGGGCAGGCAGCACACCACCCAGCAGCCAGAGCACCA 


60 






1 1 j I 1 I 1 1 1 1 1 M 1 1 1 | 1 1 1 ! 1 1 1 I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I I 1 I I I I I I i i i i i i i i i 

I I I I 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 i 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


I 


ccaaccacaagcaccaaagcagaggggcaggcagcacaccacccagcagccagagcacca 


60 


Qy 


61 


GCCCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTGCCG 

1 1 | 1 I I 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 f II I 1 1 II 1 M 1 1 1 1 1 I I I I I i i 


120 


Db 


6 1 


1 1 f 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 1 I I I I I I I 
gcccagccatggtccttgaggtgagtgaccaccaagtgctaaatgacgccgaggttgccg 


120 


Qy 


121 


CCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGTGCT 


180 






1 j 1 1 1 1 1 1 M | M 1 M 1 | 1 1 | 1 | M 1 | 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

I M II 1 II 1 1 1 II II 1 II II 1 1 1 1 II II 1 i 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 i 1 1 1 M 1 1 




Db 


121 




180 


Qy 


181 


GTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGCCAG 


240 






1 t 1 1 1 1 1 1 1 t 1 1 t 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 

1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 ! 1 t 1 II 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 




Db 


1Q1 

10 1 


gtacctccccgccctgcccacaggacttcagcctgaacttcgaccgggccttcctgccag 


2 4 0 


Qy 


241 


CCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCGTGC 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2 41 


ccctctacagcctcctctt t c t gc t ggggc tgc t gggcaacggcgcgg t ggcagccg t gc 


300 


Qy 


301 


TGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTGTAG 


360 






1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 i ) 1 1 1 1 1 1 1 1 1 1 j t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 11 1 I 1 




Db 


■3(11 


tgctgagccggcggacagccctgagcagcaccgacaccttcctgctccacctagctgtag 


3 60 


Qy 


361 


CAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGTGGG 


420 






1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 \ 1 1 1 1 
1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 






Jul 


cagacacgctgctggtgctgacactgccgctctgggcagtggacgctgccgtccagtggg 


4 20 


Qy 


421 


TCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACGCAG 


480 






\ 1 1 1 1 1 1 1 1 J 1 t J 1 1 t 1 1 1 1 1 1 i 1 1 f I I t 1 I I I I ( t \ i t i I 1 1 1 1 1 1 t ( 1 1 t l 1 t 1 * ( 1 1 

1 1 1 M II 1 1 1 II 1 1 1 1 1 If 1 1 1 1 1 II 1 1 It 1 1 1 1 ( II II 1 1 1 1 1 1 f 1 t ! 1 1 1 1 1 1 1 1 M 1 




UD 


AO 1 
H Z 1 


tctttggctctggcctctgcaaagtggcaggtgccctcttcaacatcaacttctacgcag 


4 80 


Qy 


481 


GAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCACCC 


540 






1 J 1 1 1 1 1 1 1 1 1 \ i 1 1 i 1 t 1 1 \ t 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 l l l l l l l l l l l l i 1 1 1 1 1 1 1 t 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




DO 


4 81 


gagccctcctgctggcctgcatcagctttgaccgctacctgaacatagttcatgccaccc 


54 0 


Qy 


541 


AGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGGGGC 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 




Db 


541 


agctctaccgccgggggcccccggcccgcgtgaccctcacctgcctggctgtctgggggc 


600 


Qy 


601 


TCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGCGCC 


660 






M 1 1 1 i 1 1 1 1 1 1 1 I 1 II I I 1 1 I I I I I M I 1 1 1 1 1 1 1 1 1 1 1 i i ! I 1 t I 1 1 1 1 1 i 1 1 1 1 i 1 1 




Db 


601 


tctgcctgcttttcgccctcccagacttcatcttcctgtcggcccaccacgacgagcgcc 


660 


Qy 


661 


TCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGGTGC 


720 






1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


tcaacgccacccactgccaatacaacttcccacaggtgggccgcacggctctgcgggtgc 


720 


Qy 


721 


TGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCCACA 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 ! II 1 1 




Db 


721 


tgcagctggtggctggctttctgctgcccctgctggtcatggcctactgctatgcccaca 


780 


Qy 


781 


TCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGGTGG 


840 






1 M 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


781 


tcctggccgtgctgctggtttccaggggccagcggcgcctgcgggccatgcggctggtgg 


840 


Qy 


841 


TGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGGTGG 


900 


Db 


841 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 
tggtggtcgtggtggcctttgccctctgctggaccccctatcacctggtggtgctggtgg 


900 



RN [2] 

RP SEQUENCE OF 5-368 FROM N . A . 

RX MEDLINE; 96115583. 

RA Marchese A., Heiber M., Nguyen T., Heng H.H.Q., Saldivia V.R., 

RA Cheng R., Murphy P.M., Tsui L.-C, Shi X., Gregor P., George S.R., 

RA O'Dowd B.F., Docherty J.M.; 

RT "Cloning and chromosomal mapping of three novel genes, GPR9, GPR10, 
RT and GPR14, encoding receptors related to interleukin 8, neuropeptide 
RT Y, and somatostatin receptors."; 

RL Genomics 29:335-34 4(1995). 
RN [3] 

RP LIGAND BINDING. 

RC TI SSUE=FETAL ASTROCYTES; 

RX MEDLINE; 98290735. 

RA Cole K.E., Strick C.A., Paradis T.J., Ogborne K.T., Loetscher M . , 

RA Gladue R.P., Lin W., Boyd J.G., Moser B . , Wood D.E., Sahagan B.G., 

RA Neote K.; 

RT "Interferon-inducible T cell alpha chemoat t ractant (I-TAC) : A novel 

RT non-ELR CXC chemokine with potent activity on activated T cells 

RT through selective high affinity binding to CXCR3 . " ; 

RL J. Exp. Med. 187:2009-2021(1998). 

CC -!- FUNCTION: RECEPTOR FOR SCYB9/MIG, SCYB10/INP10 AND SCYB11/ITAC. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN . 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G- PROTEIN COUPLED RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; X95876; CAA65126.1; -. 

DR EMBL; U32674; AAC50505.1; -. 

DR GCRDB; GCR_1341; -. 

DR GCRDB; GCR_1972; -. 

DR MIM; 600894; 

DR INTERPRO; IPR000276; -. 

DR PFAM; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEPTOR; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 

FT DOMAIN 1 53 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 54 80 1 (POTENTIAL) . 

FT DOMAIN 81 8 9 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 90 110 2 (POTENTIAL). 

FT DOMAIN 111 125 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 126 147 3 (POTENTIAL) . 

FT DOMAIN 148 169 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 170 189 4 (POTENTIAL). 

FT DOMAIN 190 212 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 213 233 5 ( POTENTIAL ) . 

FT DOMAIN 234 255 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 256 277 6 ( POTENTIAL) . 

FT DOMAIN 278 298 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 299 321 7 (POTENTIAL). * 

FT DOMAIN 322 368 CYTOPLASMIC (POTENTIAL). 

FT DISULFID 124 203 BY SIMILARITY. 

FT CARBOHYD 22 22 N-LINKED ( GLCNAC . . .) (POTENTIAL). 

FT CARBOHYD 32 32 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 368 AA; 40659 MW; F08A3B44B2BBAD04 CRC64; 



/ IT UUrtJDiiDiDDnUUl V H , 



Query Match 100.0%; Score 1907; DB 1; Length 368; 

Best Local Similarity 100.0%; Pred. No. le-130; 

Matches 368; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MVLEVSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPALY 60 

I ! I I I I M I I I [ I I I I I I I I I I I I I I I I I I I ( I | | | M | | | | | | | | | | | t I I ! I I I I I M 
Db 1 MVLEVSDHQVLNDAEVAALLENFSSS YDYGENESDSCCTSPPCPQDFSLNFDRAFLPALY 60 



Qy 61 SLLFLLGLLGNGAVAAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFG 120 

I I I I I I 1 I I I I I I I I I I I I I t I I I I | I ! | | | | | | I | | | | | | 

Db 61 SLLFLLGLLGNGAVAAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFG 120 

Qy 121 SGLCKVAGALFNINFYAGALLLACI SFDRYLNI VHATQLYRRGPPARVTLTCLAVWGLC L 130 

I I 1 I I M I I I I I I I I I I I I I I I II I I I I I I I I I | | | | | | | | | | | | | j | | | | | | j | 

Db 121 SGLCKVAGALFNINFYAGALLLACI S FDRYLN I VHATQLYRRGPPARVTLTCLAVWGLC L 180 

Qy 181 LFALPDFI FLSAHHDERLNATHCQYNFPQVGRTALRVLQLVAGFLLPLLVMAYCYAHI LA 240 

I I II I I I I I I I I I I I I I I I I I ( I I I I I! I I I I I I II I I I I I I | | | | | | | | | | | | | | | | | | 

181 LFALPDFI FLSAHHDERLNATHCQYNFPQVGRTALRVLQLVAGFLLPLLVMAYC YAHILA 24 0 



Db 



Qy 241 VLLVSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAK 300 

I I I I M I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I | | | | | | | M | | 
Db 241 VLLVSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAK 300 

Qy 301 SVTSGLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSET 360 

I I M I I I I I I I I I I I II I I I I I I I It I I I I I I I I I I I I f I I I t II I I I I I I I I I I | | | | | 
Db 301 SVTSGLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSET 360 

Qy 361 SEAS YSGL 368 

I I I I I I I f 
Db 361 SEASYSGL 368 



RESULT 4 
CCR5_HUMAN 

ID CCR5_HUMAN STANDARD; PRT; 312 AA . 

AC P32302; Q14811; 

DT 01-OCT-1993 (Rel. 27, Created) 

DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE C-X-C CHEMOKINE RECEPTOR TYPE 5 ( CXC-R5 ) (CXCR-5) ( BURKITT ' S LYMPHOMA 

DE RECEPTOR 1) (MONOCYTE- DERIVED RECEPTOR 15) (MDRl 5 ) . 

GN BLR1 OR CXCR5. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
RN [1] 

RP SEQUENCE FROM N.A. (LONG FORM) . 

RC TISSUE=LYMPHOCYTES; 

RX MEDLINE; 93049615. 

RA Dobner T . , Wolf I., Emrich T., Lipp M.; 

RT "Differentiation-specific expression of a novel G protein-coupled 

RT receptor from Burkitt's lymphoma."; 

RL Eur. J. Immunol. 22:2795-2799(1992). 

RN [2] 

RP SEQUENCE FROM N.A. (SHORT FORM). 

RC TISSUE=BLOOD; 

RX MEDLINE; 95366951. 

RA Barella L . , Loetscher M . , Tobler A . , Baggiolini M., Moser B.; 

RT "Sequence variation of a novel heptahelical leucocyte receptor 

RT through alternative transcript formation."; 

RL Biochem. J . 309:773-77 9(1995}. 

RN [3] 

RP LIGAND BINDING. 

RX MEDLINE; 98130629. 

RA Legler D.F., Loetscher M., Stuber Roos R., Clark-Lewis I., 

RA Baggiolini M., Moser B.; 

RT "B cell-attracting chemokine 1, a human CXC chemokine expressed in 

RT lymphoid tissues, selectively attracts B lymphocytes via 

RT BLR1/CXCR5. "; 

RL J. Exp. Med. 187:655-660(1998). 

CC -!- FUNCTION: CYTOKINE RECEPTOR THAT BINDS TO BLC . BLR1 EXERTS 

CC POSSIBLY A REGULATORY FUNCTION IN BURKITT LYMPHOMA (BL) 

CC LYMPHOMAGENESIS AND/ OR B-CELL DIFFERENTIATION. IT IS A POTENTIAL 

CC CANDIDATE FOR CELL-CELL INTERACTION, AND ACTIVATION OF MATURE B- 

CC LYMPHOCYTES IN LYMPHATIC TISSUES. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN . 



cc 


-!- ALTERNATIVE PRODUCTS: TWO FORMS OF THE RECEPTOR ARE PRODUCED BY 




cc 


ALTERNATIVE SPLICING. 






cc 


-!- TISSUE SPECIFICITY: EXPRESSION IN MATURE B-CELLS AND BURKITT 




cc 


LYMPHOMA CELLS. 






cc 
cc 
cc 


-!- SIMILARITY: BELONGS TO 


FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 




This SWISS-PROT entry is copyright. It is produced through a collaboration 


cc 


be t ween 


the Swiss Institute of Bioin f ormatics and the EMBL outstation - 


cc 


the European Bioinf ormatics 


Institute. There are no restrictions on 


its 


cc 


use by 


non-profit institutions as long as its content is in no 


way 


cc 


modi f ied 


and this statement 


is not removed. Usage by and for commercial 


cc 


entities 


requires a license 


agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch ) . 




EMBL; X68149; CAA48252.1; - 






DR 


EMBL; X68 


829; CAA48723.1; - 






DR 


PIR; S26667; S26667. 






DR 


GCRDB; GCR 0453; -. 






DR 


GCRDB; GCR 2072; -. 






DR 


GCRDB; GCR 2612; -. 






DR 


MIM; 601613; -. 






DR 


INTERPRO; 


IPR000276; - . 






DR 


INTERPRO; 


IPR001053; 






DR 


PFAM; PF00001; 7tm 1; 1. 






DR 


PRINTS; PR00237; GPCRRHODOPSN . 




DR 


PRINTS; PR00564; BURKITTSLYMR. 




DR 


PROSITE; 


PS00237; G_PROTEIN 


_RECEPTOR; 1 . 




KW 


G-protein 


coupled receptor; 


Transmembrane; Glycoprotein; B-cell; 




KW 


Alternative splicing. 






FT 


DOMAIN 


1 55 


EXTRACELLULAR ( POTENTIAL ) . 




FT 


TRANSMEM 


56 76 


1 ( POTENTIAL ) . 




FT 


DOMAIN 


77 88 


CYTOPLASMIC (POTENTIAL) . 




FT 


TRANSMEM 


89 109 


2 (POTENTIAL) . 




FT 


DOMAIN 


110 124 


EXTRACELLULAR (POTENTIAL) . 




FT 


TRANSMEM 


125 145 


3 (POTENTIAL) . 




FT 


DOMAIN 


146 167 


CYTOPLASMIC (POTENTIAL) . 




FT 


TRANSMEM 


168 188 


4 (POTENTIAL) . 




FT 


DOMAIN 


189 219 


EXTRACELLULAR (POTENTIAL) . 




FT 


TRANSMEM 


220 240 


5 (POTENTIAL) . 




FT 


DOMAIN 


241 259 


CYTOPLASMIC (POTENTIAL) . 




FT 


TRANSMEM 


260 280 


6 (POTENTIAL) . 




FT 


DOMAIN 


281 304 


EXTRACELLULAR (POTENTIAL) . 




FT 


TRANSMEM 


305 325 


7 (POTENTIAL) . 




FT 


DOMAIN 


326 372 


CYTOPLASMIC (POTENTIAL) . 




FT 


CARBOH YD 


28 28 


N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CARBOHYD 


196 196 


N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


DISULFID 


122 202 


POTENTIAL . 




FT 


VARSPLIC 


1 45 


MISSING (IN SHORT ISOFORM) . 




FT 


CONFLICT 


344 344 


G -> S (IN REF. 2) . 




SQ 


SEQUENCE 


372 AA; 41955 MW; 6DF84C8394 92ACCF CRC64 ; 





Query Match 33.3%; Score 635; DB 1; Length 372; 

Best Local Similarity 40.8%; Pred. No. 4.2e-39; 

Matches 147; Conservative 50; Mismatches 145; Indels 18; Gaps 

JFSSSY DYGENESD-SCCTSPPCPQ DFSLNFDRAFLPALYSLLFLLGLLGNG 72 

: I = I = I I = II : I I : I I I I : I I I I : : I I 



AVAAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFGSGLCKVAGALFN 132 

I : I I 111:111111111111 II : : I I I I : I I I II 

LVLVILERHRQTRSSTETFLFHLAVADLLLVFILPFAVAEGSVGWVLGTFLCKTVI ALHK 130 

INFYAGALLLACI SFDRYLNI VHATQLYRRGPPARVTLTCLAVWGLCLLFALPDFI FL-- 190 
: I I i : I I I I I I : I I i I I I I I I I : : I | : | : III:: 



-SAHHDERLNATHCQYNFPQVGRT ALRVLQLVAGFLLPLLVMAYCYAHILAVL-LV 24 4 

II: I I : : I II I I I I I I I : I II : I I : : I 

SQGHHNNSL--PRCTFSQENQAETHAWFTSRFLYHVAGFLLPMLVMGWCYVGWHRLRQA 24 8 



Qy 


20 


Db 


11 


Qy 


73 


Db 


71 


Qy 


133 


Db 


131 


Qy 


191 


Db 


191 



Qy 245 SRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAKSVTS 304 

i : I : I : I : : : I I I I I : I I I : I : : | | | |: | : I | : : 

Db 24 9 QRRPQRQKAVRVAI LVTS I FFLCWS PYH I VI FLDTLARLKAVDNTCKLNGSLPVAI TMCE 308 

Qy 305 GLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEAS 364 

II N I I if : I I I I I I | | : I I : | I | : I I I I I i | : : 

Db 309 FLGLAHCCLNPMLYTFAGVKFRSDLSRLLTKLGCTGPASLCQLFPSWRR-SSLSESENAT 367 



RESULT 1 
015185 

ID 015185 PRELIMINARY; PRT; 415 AA . 

AC 015185; 

DT OWAN-1998 (TrEMBLrel. 05, Created) 

DT 0WAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2000 (TrEMBLrel. 14, Last annotation update) 

DE G PROTEIN- COUPLED RECEPTOR CKR-L2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Gutierrez J . , Varona R., Zaballos A., Lind P., Marquez G.; 

RL Submitted (SEP-1996) to the EMBL/GenBank /DDB J databases. 

DR EMBL; Z79783; CAB02143.1; 

DR INTERPRO; IPR000276; -. 

DR PFAM; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROT EI NONRECEPTOR; UNKNOWN_l . 

SQ SEQUENCE 415 AA; 45608 MW; DE4 7 4 B208 4 BC2 99B CRC64 ; 



Query Match 98.9%; Score 1886; DB 4; Length 415; 

Best Local Similarity 99.5%; Pred. No. 3.2e-151; 

Matches 363; Conservative 1; Mismatches 1; Indels 0; Gaps 

Qy 4 EVSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPALYSLL 63 

: I I I I I ! I I II I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I II I I I I I t I I I I I I I I 
Db 51 QVSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPALYSLL 110 

Qy 64 FLLGLLGNGAVAAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFGSGL 123 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I M I I I I I I 
Db 111 FLLGLLGNGAVRAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFGSGL 17 0 

Qy 124 CKVAGALFNINFYAGALLLACISFDRYLNIVHATQLYRRGPPARVTLTCLAVWGLCLLFA 183 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I | | | | | 
Db 171 CKVAGALFNINFYAGALLLACISFDRYLNIVHATQLYRRGPPARVTLTCLAVWGLCLLFA 230 

Qy 184 LPDFI FLSAHHDERLNATHCQYNFPQVGRTALRVLQLVAGFLLPLLVMAYCYAHILAVLL 243 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I II I I I I I I II I I I I I I I I I I 
Db 231 LPDFI FLSAHHDERLNATHCQYNFPQVGRTALRVLQLVAGFLLPLLVMAYCYAHILAVLL 2 90 

Qy 24 4 VSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAKSVT 303 

I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I II I I I M I I I I I I I I | | | || | | | | | | | 
Db 291 VSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAKSVT 350 

Qy 304 SGLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEA 363 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 351 SGLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEA 410 

Qy 364 SYSGL 368 
I I I I I 

Db 411 SYSGL 415 



Qy 


901 


ACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAGACG 


960 






H 1 1 M 1 1 1 1 1 1 [ 1 I I ! M 1 1 1 1! 1 I ! I | | | I I 1 1 1 1 ! 1 1 1 1 1 1 1 M 1 1 1 I I 1 1 1 1 1 1 1 1 

' p 1 i i i i i 1 ' 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 




Db 


901 


acatcc tcatggacctgggcgctttggcccgcaactgtggccgagaaagcagggtagacg 


960 


Qy 


961 


TGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGCTCT 


1020 






1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 1 | | | | | | | | | | | ! | | | | | | | | | | | j | | | | | | | j | | | | | | j | | | | 




Db 


961 


tggccaagtcggtcacctcaggcctgggctacatgcactgctgcctcaacccgctgctct 


1020 


Qy 


1021 


ATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGGGCT 


1080 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 i 1 1 1 1 1 I 1 1 1 1 1 1 1 | j| 1 1 1 




Db 


1021 


atgcctttgtaggggtcaagttccgggagcggatgtggatgctgctcttgcgcctgggct 


1080 


Qy 


1081 


GCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCTGGT 


1140 






1 1 1 I 1 ! 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 ! 1 1 1 i 1 i | | 




Db 


1081 


gccccaaccagagagggctccagaggcagccatcgtcttcccgccgggattcatcctggt 


1140 


Qy 


1141 


CTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTTTCG 


1200 






M 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 ! 1 ! 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 M I 1 1 II 1 I 1 1 




Db 


1141 


ctgagacctcagaggcctcctactcgggcttgtgaggccggaatccgggctcccctttcg 


1200 


Qy 


1201 


CCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCTCTC 


1260 






1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I j I | | | 




Db 


1201 


cccacagtctgacttccccgcattccaggctcctccctccctctgccggctctggctctc 


1260 


Qy 


1261 


CCCAATATCCTCGCTCCCGGGACTCACTGGCAGCCCCAGCACCACCAGGTCTCCCGGGAA 


1320 






1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 ! 1 1 M 1 1 1 




Db 


1261 


cccaatatcctcgctcccgggactcactggcagccccagcaccaccaggtctcccgggaa 


1320 


Qy 


1321 


GCCACCCTCCCAGCTCTGAGGACTGCACCATTGCTGCTCCTTAGCTGCCAAGCCCCATCC 


1380 






1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 




Db 


1321 


gccaccctcccagctctgaggactgcaccattgctgctccttagctgccaagccccatcc 


1380 


Qy 


1381 


TGCCGCCCGAGGTGGCTGCCTGGAGCCCCACTGCCCTTCTCATTTGGAAACTAAAACTTC 


1440 






1 I 1 M 1 1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 | | | | 




Db 


1381 


tgccgcccgaggtggctgcctggagccccactgcccttctcatttggaaactaaaacttc 


1440 


Qy 


1441 


ATCTTCCCCAAGTGCGGGGAGTACAAGGCATGGCGTAGAGGGTGCTGCCCCATGAAGCCA 


1500 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 I 1 | 




Db 


1441 


atcttccccaagtgcggggagtacaaggcatggcgtagagggtgctgccccatgaagcca 


1500 


Qy 


1501 


CAGCCCAGGCCTCCAGCTCAGCAGTGACTGTGGCCATGGTCCCCAAGACCTCTATATTTG 


1560 






1 1 1! 1 1 1 1 1 1 1 1 !! 1 1 1 1 1 1 1 M 1 1 1 1 I II 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 1 1 II 1 1 1 I 1 1 1 1 




Db 


1501 


cagcccaggcctccagctcagcagtgactgtggccatggtccccaagacctctatatttg 


1560 


Qy 


1561 


CTCTTTTATTTTTATGTCTAAAATCCTGCTTAAAACTTTTCAATAAACAAGATCGTCAGG 


1620 






M 1 1 1 1 1 1 I I I I 1 1 I I 1 I | | | I | | | | M 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 ! I I | I I | | | | | | M 




Db 


1561 


ctcttttatttttatgtctaaaatcctgcttaaaacttttcaataaacaagatcgtcagg 


1620 


Qy 


1621 


ACCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 1670 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I 1 M 1 1 1 




Db 


1621 


accttttttttttttttttttttttttttttttttttttttttttttttt 1670 





RESULT 2 
T72800 

ID T72800 standard; cDNA; 1876 BP. 
XX 

AC T7 2800; 

XX 

DT 28-SEP-1997 (first entry) 
XX 

DE Human G-protein chemokine receptor HSATU68 cDNA. 

XX 

KW HSATU68 ; G-protein chemokine receptor; 7- transmembrane receptor; 
KW signal transduction; gene therapy; diagnosis; ss. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 173.. 1420 



FT / + tag= a 

FT primer_bind complement (173.. 190) 

FT /*tag= b 

FT primer_bind 1402. .1420 

FT /*tag= c 

XX 

PN WO9725340-A1 . 
XX 

PD 17-JUL-1997. 

XX 

PF ll-JAN-1996; 96WO-US004 99 . 

XX 

PR ll-JAN-1996; 9 6WO-US004 99 . 

XX 

PA (HUMA-) HUMAN GENOME SCI INC. 

PI Li Y; 
XX 

DR WPI; 1997-372810/34. 

DR P-PSDB; W19780. 

XX 

PT DNA encoding new isolated human G-protein chemokine receptor - used 

PT to develop products for treating, e.g. autoimmune diseases, chronic 

PT infections, allergy, malignancy, inflammation or shock 

XX 

PS Claim 1; Fig la-d; 54pp; English. 

XX 

CC A cDNA clone (T72800) codes for a newly identified human G-protein 

CC chemokine receptor (W19780) designated HSATU68 . It was identified 

CC in a human genomic library derived from activated T cells. HSATU68 

CC nucleic acids be used to produce HSATU68 in prokaryotic or 

CC eukaryotic host cells, as probes for the detection, screening and 

CC diagnosis of diseases caused by mutations in the HSATU68 gene, and 

CC in the gene therapy of conditions related to underexpression of the 

CC receptor its ligand. 
XX 

SQ Sequence 1876 BP; 343 A; 625 C; 520 G; 388 T; 0 other; 



Query Match 92.2%; Score 1540; DB 18; Length 1876; 



Best Local Similarity 99.0%; Pred. No. 6.2e-287; 




Matches 


1549; Conservative 0; Mismatches 15; Indels 0; Gaps 


Qy 


58 


CCAGCCCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTG 


117 






II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 




Db 


303 


cctccccgttcccgccctcacaggtgagtgaccaccaagtgctaaatgacgccgaggttg 


362 


Qy 


118 


CCGCCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGT 


177 






N 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | | | | | M | | | | | | | | ! | M 




Db 


363 


ccgccctcctggagaacttcagctcttcctatgactatggagaaaacgagagtgactcgt 


422 


Qy 


178 


GCTGTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGC 


237 






1 ! 1 1 1 1 M i 1 1 1 1 1 1 1 1 M i i 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 II 




Db 


423 


gctgtacctccccgccctgcccacaggacttcagcctgaacttcgaccgggccttcctgc 


482 


Qy 


238 


CAGCCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCG 


297 






MINIM 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 M II 1 1 1 1 II 1 II M M M 1 1 1 II II 1 1 II M 1 




Db 


483 


cagccctcaacagcctcctctttctgctggggctgctgggcaacggcgcggtggcagccg 


542 


Qy 


298 


TGCTGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTG 


357 






M 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II II M 1 1 1 M M 1 1 II 1 1 1 1 1 II M 1 




Db 


543 


tgctgctgagccggcggacagccctgagcagcaccgacaccttcctgctccacctagctg 


602 


Qy 


358 


TAGCAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGT 


417 






1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 II 1 II M 1 1 1 1 II M 1 




Db 


603 


tagcagacacgctgctggtgctgacactgccgctctgggcagtggacgctgccgtccagt 


662 


Qy 


418 


GGGTCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACG 


477 






1 M M 1 1 1 1 1 1 1 1 1 1 1 1 M M II II 1 1 1 1 1 1 M 1 1 1 II 1 1 II 1 1 1 M II 1 1 II 1 1 1 II II 




Db 


663 


gggtctttggctctggcctctgcaaagtggcaggtgccctcttcaacatcaacttctacg 


722 



4 1 8 CAGGAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCA 5 3 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill 

72 3 caggagccctcctgctggcctgcatcagctttgaccgctacctgaacatagttcatgcca 7 82 

538 CCCAGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGG 597 
I I M I I I I I I I I I I I I I I I | | | | | | | || | , 

783 cccagctctaccgccgggggcccccggcccgcgtgaccctcacctgcctggctgtctggg 842 

5 98 GGCTCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGC 657 

I I I M I I 1 I I I I I I I I I I I I | | i { | | | | | | | M M | M | | | | | | | | j | | j | | | | | | j | | | 
84 3 ggctctgcctgcttttcgccctcccagacttcatcttcctgtcggcccaccacgacgagc 902 

658 GCCTCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGG 717 

I I M M I I I I I I I I M I I I I I I II I I I I I I I I II II 1 | | | | | M | | | | | || | | M I I I I I 
90 3 gcctcaacgccacccactgccaatacaacttcccacaggtgggccgcacggctctgcggg 9 62 

718 TGCTGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCC 777 

I I M I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I | I j | | 
9 63 tgctgcagctggtggctggctttctgctgcccctgctggtcatggcctactgctatgccc 1022 

778 ACATCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGG 8 37 
I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I | | 
102 3 acatcctggccgtgctgctggtttccaggggccagcggcgcctgcgggccatgcggctgg 1082 

8 38 TGGTGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGG 8 97 
I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I | i | 
108 3 tggtggtggtcgtggtggcctttgccctctgctggaccccctatcacctggtggtgctgg 1142 

8 98 TGGACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAG 957 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I || | | || I 
1143 tggacatcctcatggacctgggcgctttggcccgcaactgtggccgagaaagcagggtag 12 02 

958 ACGTGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGC 1017 
I I I I I I I I I M I I I I I I I I I I I I I I I (I I I I I I I I I I I I I I II I I I I I I I I I || | | | | | | 
1203 acgtggccaagtcggtcacctcaggcctgggctacatgcactgctgcctcaacccgctgc 12 62 

1018 TCTATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGG 1077 

I M ! I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I 
12 63 tctatgcctttgtaggggtcaagttccgggagcggatgtggatgctgctcttgcgcctgg 1322 

107 8 GCTGCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCT 1137 

I I I M I M I I I I II II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | ! | | | 
1323 gctgccccaaccagagagggctccagaggcagccatcgtcttcccgccgggattcatcct 13 82 

1138 GGTCTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTT 1197 

I I I I I I I M I I I I I I I I II I I I II I I I I I I I I I I I I I I I i M i I II I I I I I I I I I I I I I I 
1383 ggtctgagacctcagaggcctcctactcgggcttgtgaggccggaatccgggctcccctt 14 42 

1198 TCGCCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCT 12 57 

M I I I I I I I M I I I I I I I I I II I I I I ! I M I I I I I I I I I I I I I I I I I I I I I i I | | | | | | | 
14 4 3 tcgcccacagtctgacttccccgcattccaggctcctccctccctctgccggctctggct 1502 

12 58 CTCCCCAATATCCTCGCTCCCGGGACTCACTGGCAGCCCCAGCACCACCAGGTCTCCCGG 1317 

I 1 I I I I I M I M I I I I I I II I I I I I I I I II I I I I I I I I I I I I I M II I II I I I I I M I I I 
1503 ctccccaatatcctcgctcccgggactcactggcagccccagcaccaccaggtctcccgg 15 62 

1318 GAAGCCACCCTCCCAGCTCTGAGGACTGCACCATTGCTGCTCCTTAGCTGCCAAGCCCCA 1377 

I I I I I I I I I II I I I I II I I I I I I I I M I I I I I I I M I I I 1 I | | | | | | | | If | | | | | | | | | 
1563 gaagccaccctcccagctctgaggactgcaccattgctgctccttagctgccaagcccca 1 622 

1378 TCCTGCCGCCCGAGGTGGCTGCCTGGAGCCCCACTGCCCTTCTCATTTGGAAACTAAAAC 14 37 

I I I I I I I I I I II I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
162 3 tcctgccgcccgaggtggctgcctggagccccactgcccttctcatttggaaactaaaac 1682 

14 38 TTCATCTTCCCCAAGTGCGGGGAGTACAAGGCATGGCGTAGAGGGTGCTGCCCCATGAAG 14 97 

M I I I I I I I I I II I I I I M I I I I I I I I ! I I 1 I I | I | | | | | || | | | | | | | | | M | | | | | | | 
168 3 ttcatcttccccaagtgcggggagtacaaggcatggcgtagagggtgctgccccatgaag 17 42 

14 98 CCACAGCCCAGGCCTCCAGCTCAGCAGTGACTGTGGCCATGGTCCCCAAGACCTCTATAT 1557 

I I I I I I I I II I I II I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I | | | | | | | M U | | 
17 4 3 ccacagcccaggcctccagctcagcagtgactgtggccatggtccccaagacctctatat 1802 



Qy 1558 TTGCTCTTTTATTTTTATGTCTAAAATCCTGCTTAAAACTTTTCAATAAACAAGATCGTC 1617 

1 I I I I I I I I I I I I I I I I ! I I I I I M I I I I I I I I I I ! I I I I 

Db 1803 ttggtcttttatttttatgtctaaaatcctgcttaaaacttttcaataaacaagatcgtc 1862 

Qy 1618 AGGA 1621 
I I I I 

Db 1863 agga 1866 



RESULT 1 

HSGPCRIN8 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

sou rce 



CDS 



interleukin-8 receptor. 



polyA_s 
polyA_s 

BASE COUNT 

ORIGIN 



HSGPCRIN8 1670 bp mRNA PRI 16-MAY-1997 

H. sapiens mRNA for G-protein coupled receptor. 
X95876 

X95876.1 GI:1552845 
G protein-coupled receptor; 
human . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; 
Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1670) 

Loetscher , M . , Gerber,B., Loetsche r , P . , Jones, S. A., Piali,L., 
Clark-Lewis,!., Baggiolini , M . and Moser,B. 

Chernokine receptor specific for IP10 and mig: structure, function, 
and expression in activated T-lymphocytes 
J. Exp. Med. 184 (3), 963-969 (1996) 
97188912 

2 (bases 1 to 1670} 
Loetscher, M . 
Direct Submission 

Submitted ( 2 6-FEB-1996) M . Loetscher, Theodor-Kocher Institute, 
University of Bern, Freiestrasse 1, CH-3012 Bern, SWITZERLAND 

Location/Qualifiers 

1. .1670 

/organism="Homo sapiens" 
/db_xref=" taxon : 9606" 
/haplotype= "diploid" 
/cell_line="KT30" 
/cell__type="T lymphocytes CD4 + " 
/ clone="MLRA" 

/clone_lib="lambda ZAP Express" 
/dev_stage=" adult" 
/tissue_type=" blood" 
69. .1175 
/codon_start=l 

/product="G-protein coupled receptor" 
/protein_id="CAA6512 6. 1" 
/db_xref="GI : 1552846" 
/db_xref="SWISS-PROT:P4 9682" 

/ t ransla t ion= "MVLEVSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCP 
QDFSLNFDRAFLPALYSLLFLLGLLGNGAVAAVLLSRRTALSSTDTFLLHLAVADTLL 
VLTLPLWAVDAAVQWVFGSGLCKVAGALFNINFYAGALLLACISFDRYLNI VHATQLY 
RRGPPARVTLTCLAVWGLCLLFALPDFIFLSAHHDERLNATHCQYNFPQVGRTALRVL 
QLVAGFLLPLLVMAYCYAHILAVLLVSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVL 
VDILMDLGALARNCGRESRVDVAKSVTSGLGYMHCCLNPLLYAFVGVKFRERMWMLLL 
RLGCPNQRGLQRQPSSSRRDSSWSETSEASYSGL" 

ignal 1602. .1607 

ite 1623 

319 a 574 c 436 g 341 t 



Query Match 97.2%; Score 1623; DB 66; Length 1670; 

Best Local Similarity 100.0%; Pred. No. l.le-255; 

Matches 1623; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 CCAACCACAAGCACCAAAGCAGAGGGGCAGGCAGCACACCACCCAGCAGCCAGAGCACCA 60 

I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I 
Db 1 CCAACCACAAGCACCAAAGCAGAGGGGCAGGCAGCACACCACCCAGCAGCCAGAGCACCA 60 

Qy 61 GCCCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTGCCG 120 

I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I 
Db 61 GCCCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTGCCG 120 

Qy 121 CCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGTGCT 180 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f 
Db 121 CCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGTGCT 180 



Qy 181 GTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGCCAG 24 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



181 GTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGCCAG 240 



Qy 241 CCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCGTGC 300 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I ! I I t I I I I I M I I I I i I I I I I I I II I 
Db 241 CCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCGTGC 300 

Qy 301 TGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTGTAG 360 

I I I I M I I M I I I I I I I I I t I II I II I I I I I I I I I I I I I ) I I t | | | j | I | | | M | | | | | | 
Db 301 TGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTGTAG 360 

Qy 3 61 CAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGTGGG 420 

I M I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I t I 

Db 361 CAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGTGGG 420 

Qy 421 TCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACGCAG 4 80 

I I M I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I [ I I I I | | 
Db 421 TCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACGCAG 4 80 

Qy 481 GAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCACCC 540 

I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I II I! If I I I II I i I I I I I I I M I I I | 
Db 481 GAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCACCC 54 0 

Qy 541 AGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGGGGC 600 

I I I I I I I I I I I I I I I I I I I I I M M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
Db 541 AGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGGGGC 600 

Qy 601 TCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGCGCC 660 

I I I I I M I I I I I M I I I I I I I I I I I I i I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 601 TCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGCGCC 660 

Qy 661 TCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGGTGC 720 

I M I I I I I I II I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I II I I II II I I I I I II 
Db 661 TCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGGTGC 72 0 

Qy 721 TGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCCACA 7 80 

I I t I I I I I I I I I I I I I II I I I I I I II I I II I I M II I I ( 1 I I II I I I I I I I I I I II I I M 
Db 721 TGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCCACA 7 80 

Qy 781 TCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGGTGG 84 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I 
Db 781 TCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGGTGG 84 0 

Qy 841 TGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGGTGG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 TGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGGTGG 900 

Qy 901 ACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAGACG 960 

I I I I I I t I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I II 
Db 901 ACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAGACG 9 60 

Qy 961 TGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGCTCT 102 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 9 61 TGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGCTCT 1020 

Qy 1021 ATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGGGCT 1080 

II I I I I I I I I I I I I I I I I I I M II I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 ATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGGGCT 1080 

Qy 1081 GCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCTGGT 1140 

I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1081 GCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCTGGT 114 0 

Qy 1141 CTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTTTCG 1200 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 1141 CTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTTTCG 1200 

Qy 1201 CCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCTCTC 12 60 

I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 01 CCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCTCTC 12 60 

Qy 1261 CCCAATATCCTCGCTCCCGGGACTCACTGGCAGCCCCAGCACCACCAGGTCTCCCGGGAA 1320 



1 M I I I I i i I I I I I I I I I | | | | | ( ) | | | | | | | | | | | | j | | | | | i | 

Db 1261 CCCAATATCCTCGCTCCCGGGACTCACTGGCAGCCCCAGCACCACCAGGTCTCCCGGGAA 1320 

Qy 1321 GCCACCCTCCCAGCTCTGAGGACTGCACCATTGCTGCTCCTTAGCTGCCAAGCCCCATCC 1380 

I I I I I II I I I I I I I I I M I I I I I t I I I I I I I I I I I I | | | | | | | 

Db 1321 GCCACCCTCCCAGCTCTGAGGACTGCACCATTGCTGCTCCTTAGCTGCCAAGCCCCATCC 1380 

Qy 13 81 TGCCGCCCGAGGTGGCTGCCTGGAGCCCCACTGCCCTTCTCATTTGGAAACTAAAACTTC 1440 

I I M I M I I I I I I I I I I I I I I II I I I I I I I I I I I ! I I I I I I I I I I I I I I I I | | | | | | | | | 
Db 1381 TGCCGCCCGAGGTGGCTGCCTGGAGCCCCACTGCCCTTCTCATTTGGAAACTAAAACTTC 1440 

Qy 14 41 ATCTTCCCCAAGTGCGGGGAGTACAAGGCATGGCGTAGAGGGTGCTGCCCCATGAAGCCA 1500 

I I I I I M I I I I I I I I M II I I I II I I ! I I II I I I I I I I M I I I I | | | | | | | | | | | | | | | | 
Db 1441 ATCTTCCCCAAGTGCGGGGAGTACAAGGCATGGCGTAGAGGGTGCTGCCCCATGAAGCCA 1500 

Qy 1501 CAGCCCAGGCCTCCAGCTCAGCAGTGACTGTGGCCATGGTCCCCAAGACCTCTATATTTG 1560 

M I I I II I I I I I I I I I I I I I I I I M | M | | | | | I | I | | | | | | | | | | | | | | | | | | | | | | | | 
Db 1501 CAGCCCAGGCCTCCAGCTCAGCAGTGACTGTGGCCATGGTCCCCAAGACCTCTATATTTG 1560 

Qy 1561 CTCTTTTATTTTTATGTCTAAAATCCTGCTTAAAACTTTTCAATAAACAAGATCGTCAGG 1620 

I I M I I I I I I I t I 1 I I I I II I I I I I I I I I I I I I I I I I I | I | | | | M | | | | | M I I I I I I I 
Db 1561 CTCTTTTATTTTTATGTCTAAAATCCTGCTTAAAACTTTTCAATAAACAAGATCGTCAGG 1620 

Qy 1621 ACC 1623 
I I I 

Db 1621 ACC 1623 



RESULT 2 
HSXDP_4 /c 
WPCOMMENT 

Sequence split into 

Fragment Name 

HSXDP_0 

HSXDP_1 

HSXDP_2 

HSXDP_3 

HSXDP_4 

HSXDP_5 
Continuation (5 of 6) 
cosmic! cosE3 map Xql3. 



6 fragments LOCUS HSXDP Accession AJ239319 
Begin End 

1 110000 
100001 210000 
200001 310000 
300001 410000 
400001 510000 
500001 547722 

of HSXDP from base 400001 (AJ239319 Homo sapiens chromosome X clone 
1, *** SEQUENCING IN PROGRESS 4 ordered pieces. 5/2000) 



Query Match 92.5%; Score 1545.2; DB 90; Length 110000; 

Best Local Similarity 99.2%; Pred. No. 3e-243; 

Matches 1553; Conservative 0; Mismatches 13; Indels 0; Gaps 0; 

Qy 58 CCAGCCCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTG 117 

II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I M I I I I 

Db 38061 CCTCCCCGTTCCCGCCCTCACAGGTGAGTGACCACCAAGTGCTAAATGACGCCGAGGTTG 38002 

Qy 118 CCGCCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGT 177 

I I I I I I 1 I I II I I I I I I I I I I I II I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I 
Db 38001 CCGCCCTCCTGGAGAACTTCAGCTCTTCCTATGACTATGGAGAAAACGAGAGTGACTCGT 37 942 

Qy 178 GCTGTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGC 2 37 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I II I I I I I I I I I 
Db 37941 GCTGTACCTCCCCGCCCTGCCCACAGGACTTCAGCCTGAACTTCGACCGGGCCTTCCTGC 37882 

Qy 2 38 CAGCCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCG 297 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 37881 CAGCCCTCTACAGCCTCCTCTTTCTGCTGGGGCTGCTGGGCAACGGCGCGGTGGCAGCCG 37 822 

Qy 2 98 TGCTGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTG 357 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 37 821 TGCTGCTGAGCCGGCGGACAGCCCTGAGCAGCACCGACACCTTCCTGCTCCACCTAGCTG 37 7 62 

Qy 358 TAGCAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGT 417 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 37 7 61 TAGCAGACACGCTGCTGGTGCTGACACTGCCGCTCTGGGCAGTGGACGCTGCCGTCCAGT 377 02 



Qy 418 GGGTCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACG 477 

I I I I I I M I I t I I I ! I I I ! I : I I I I I I | | M I I I ! I I I I I I I | | I I [| I I I I I I I I I 
Db 37701 GGGTCTTTGGCTCTGGCCTCTGCAAAGTGGCAGGTGCCCTCTTCAACATCAACTTCTACG 37642 

Qy 478 CAGGAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCA 537 

I I I I I I I I I f I I I I I I I I t I I I I I II I I ! I I I I I I I | I I | M | | | | I ! | | | | M | | | | M 
Db 37641 CAGGAGCCCTCCTGCTGGCCTGCATCAGCTTTGACCGCTACCTGAACATAGTTCATGCCA 37582 

Qy 538 CCCAGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGG 597 

I I I I I I I I I M I I I I I I I t I I I I I ! I I i II I I I I I I I I I I I I I I I I I I I I I I | I | | | | M 
Db 37581 CCCAGCTCTACCGCCGGGGGCCCCCGGCCCGCGTGACCCTCACCTGCCTGGCTGTCTGGG 37522 

Qy 598 GGCTCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGC 657 

I I I I I I I I I I I I I I I I I I I I ! I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I | If 
Db 37521 GGCTCTGCCTGCTTTTCGCCCTCCCAGACTTCATCTTCCTGTCGGCCCACCACGACGAGC 37462 

Qy 658 GCCTCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGG 717 

I M I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I II I I I I I I I I I I I I I I M | || 
Db 37461 GCCTCAACGCCACCCACTGCCAATACAACTTCCCACAGGTGGGCCGCACGGCTCTGCGGG 37402 

Qy 718 TGCTGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCC 777 

I I I I I I I II I I I II I I I I I I I I I I II I I I I I I I I I II II I I I I I I I I I I I I I I | | | | | | | 
Db 37401 TGCTGCAGCTGGTGGCTGGCTTTCTGCTGCCCCTGCTGGTCATGGCCTACTGCTATGCCC 37342 

Qy 778 ACATCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGG 837 

I I I I M I I II I I II I I I I I I I I I ! I I I M I I I | | | | | | | | | | | | | M | | | | | | | | | | | | | 
Db 37341 ACATCCTGGCCGTGCTGCTGGTTTCCAGGGGCCAGCGGCGCCTGCGGGCCATGCGGCTGG 37282 

Qy 838 TGGTGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGG 897 

I M I I I I I I ! I I I I I I I I I I I I I M I I I I I I I II I I I I M I I I I I I I I I I I I I 1 I I | || | 
Db 37281 TGGTGGTGGTCGTGGTGGCCTTTGCCCTCTGCTGGACCCCCTATCACCTGGTGGTGCTGG 37222 

Qy 898 TGGACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAG 957 

I I I I I I I I I ! I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I | 
Db 37221 TGGACATCCTCATGGACCTGGGCGCTTTGGCCCGCAACTGTGGCCGAGAAAGCAGGGTAG 37162 

Qy 958 ACGTGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGC 1017 

M I I I M I I I I I I I I I I I I M I I I II I I II I I I I I I II I I I I I I I I I I I I I I I M I I I I I 
Db 37161 ACGTGGCCAAGTCGGTCACCTCAGGCCTGGGCTACATGCACTGCTGCCTCAACCCGCTGC 37102 

Qy 1018 TCTATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGG 1077 

I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 37101 TCTATGCCTTTGTAGGGGTCAAGTTCCGGGAGCGGATGTGGATGCTGCTCTTGCGCCTGG 37 042 

Qy 107 8 GCTGCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCT 1137 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I | | | I | | | 
Db 370 41 GCTGCCCCAACCAGAGAGGGCTCCAGAGGCAGCCATCGTCTTCCCGCCGGGATTCATCCT 3 6982 

Qy 1138 GGTCTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTT 1197 

I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I | I | | | | | | | | | | | | | | | | | | | | | | 
Db 36981 GGTCTGAGACCTCAGAGGCCTCCTACTCGGGCTTGTGAGGCCGGAATCCGGGCTCCCCTT 36922 

Qy 1198 TCGCCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCT 1257 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 36921 TCGCCCACAGTCTGACTTCCCCGCATTCCAGGCTCCTCCCTCCCTCTGCCGGCTCTGGCT 368 62 

Qy 12 58 CTCCCCAATATCCTCGCTCCCGGGACTCACTGGCAGCCCCAGCACCACCAGGTCTCCCGG 1317 

I I I I I I I I I 1 I I I I I I I II I I I I I II I I I I I I I II I I II II I I I I I I I I I I I I I I I I I I I 
Db 368 61 CTCCCCAATATCCTCGCTCCCGGGACTCACTGGCAGCCCCAGCACCACCAGGTCTCCCGG 3 6802 

Qy 1318 GAAGCCACCCTCCCAGCTCTGAGGACTGCACCATTGCTGCTCCTTAGCTGCCAAGCCCCA 1377 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 36801 GAAGCCACCCTCCCAGCTCTGAGGACTGCACCATTGCTGCTCCTTAGCTGCCAAGCCCCA 36742 

Qy 137 8 TCCTGCCGCCCGAGGTGGCTGCCTGGAGCCCCACTGCCCTTCTCATTTGGAAACTAAAAC 14 37 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 36741 TCCTGCCGCCCGAGGTGGCTGCCTGGAGCCCCACTGCCCTTCTCATTTGGAAACTAAAAC 36682 



Qy 14 38 TTCATCTTCCCCAAGTGCGGGGAGTACAAGGCATGGCGTAGAGGGTGCTGCCCCATGAAG 14 97 

M I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 36681 TTCATCTTCCCCAAGTGCGGGGAGTACAAGGCATGGCGTAGAGGGTGCTGCCCCATGAAG 36622 



RESULT 1 
W54371 

ID W54371 standard; Protein; 368 AA. 

XX 

AC W54371; 

XX 

DT 14-AUG-1998 (first entry) 

DE Human IP-10/Mig receptor CXCR3 protein, 
y v 

KW chemokine receptor; cellular signal; treatment; T cell; anti tumour; 
KW antiviral; inflammatory disease. 
XX 

OS Homo sapiens . 
XX 

FH Key 

FT Modif ied-site 
FT 

FT Modified-site 
FT 

FT Domain 
FT 

FT Domain 
FT 

FT Domain 
FT 

FT Domain 
FT 

FT Modified-site 
FT 

FT Domain 
FT 

FT Domain 
FT 

FT Domain 
FT 

XX 

PN W09811218-A1 . 

XX 

PD 19-MAR-1998. 

XX 

PF 10-SEP-1997; 97WO-US15915 . 

XX 

PR 31-MAR-1997; 97US-082 9839 . 
PR 10-SEP-1996; 96US-0709838 . 

XX 

PA (KOCH-) KOCHER INST THEODOR. 

PA (LEUK-) LEUKOSITE INC. 

XX 

PI Loetscher M, Mackay CR, Moser B, Qin S; 

XX 

DR WPI; 1998-207381/18. 
DR N-PSDB; V26557. 

XX 

PT DNA encoding CXC chemokine receptor 3 - inhibitors and promoters of 
PT which, are useful for treatment of inflammation or in anti-tumour or 
PT anti-viral therapy 

XX 

PS Claim 7; Fig 2; 137pp; English. 

XX 

CC The mammalian CXC chemokine receptor 3 (CXCR3) protein can selectively 
CC bind one or more chemokines and can mediate cellular signalling and/or a 
CC cellular response in response. Inhibitors and promoters of mammalian 
CC CXCR3 can be detected and identified using host cells expressing CXCR3 . 
CC CXCR3 inhibitors can be used for treatment of inflammatory diseases which 
CC are T cell mediated. CXCR3 promoters are useful for antitumour or 
CC antiviral therapy. 
XX 

SQ Sequence 368 AA; 



Location/Quali f iers 
22 

/note* "N-linked glycosylation site" 
32 

/note= "N-linked glycosylation site"" 
59. .79 

/note= "TM1 transmembrane domain" 
91 . , 111 

/note= "TM2 transmembrane domain" 
127 . . 147 

/note= "TM3 transmembrane domain" 
170. . 190 

/note= "TM4 transmembrane domain" 
199 

/note= "N-linked glycosylation site" 
224 . .244 

/note= "TM5 transmembrane domain" 
257 . .277 

/note= "TM6 transmembrane domain" 
302 . . 322 

/note- "TM7 transmembrane domain" 



Query Match 100.0%; Score 1907; DB 19; Length 368; 

Best Local Similarity 100.0%; Pred. No. 1.6e-217; 

Matches 368; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MVLEVSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPALY 60 

M j I I I I I I I I I 1 I I I I I I I I I t I I I I I I ! I 1! I I I I I | I | | | | | I | | M I I I I I I I I ! I 
Db 1 mvlevsdhqvlndaevaallenfsssydygenesdscctsppcpqdfslnfdraflpaly 60 

Qy 61 SLLFLLGLLGNGAVAAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFG 120 

M I I I I I I I 1 I ! I I I I ! I I I I I I I I I M I M I I I I I 1 I I I I I I I I M ! I I I I I I I I I I M 
Db 61 sllf Ilgllgngavaavllsrrtalsstdtf llhlavadtllvl tlplwavdaavqwvf g 120 

Qy 121 SGLCKVAGALFNINFYAGALLLAC1 SFDRYLNI VHATQLYRRGPPARVTLTCLAVWGLCL 180 

t I t I I I I I I I I I I I I I I I I I I I I I I I t II I I I I I I I I I I I I t I I I I I I M I I I I I I I I I t 
Db 121 sglckvagalfninf yagalllacis f drylnivhatqlyrrgpparvtl tclavwglcl 180 

Qy 181 LFALPDFI FLSAHHDERLNATHCQYNFPQVGRTALRVLQLVAGFLLPLLVMAYCYAHILA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I If I I I I I I I I I I I I I I | | M 

Db 181 If alpdfiflsahhderlnathcqynfpqvgrtalrvlqlvagfllpllvmaycyahila 240 

Qy 241 VLLVSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAK 300 

M I I I I I I I i I I I 1 I I I M II I I I I I I I I I I II I I II II I I I I I I I I II I I I || | | | || | 
Db 241 vllvsrgqrrlramrlvvvvvvafalcwtpyhlvvlvdilmdlgalarncgresrvdvak 300 

Qy 301 SVTSGLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSET 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I i I I I I I I I | | 
Db 301 svtsglgymhcclnpllyafvgvkf rermwmlllrlgcpnqrglqrqpsssrrdsswset 360 

Qy 361 SEAS YSGL 368 

I I I I I I I I 
Db 361 seasysgl 368 



RESULT 6 
R27793 

ID R27793 standard; Protein; 372 AA. 

XX 

AC R27793; 
XX 

DT 12-MAR-1993 (first entry) 
XX 

DE New platelet factor 4 receptor superfamily member PF4 ARii . 

XX 

KW IL-8R; G-protein coupled receptor family; rhodopsin superfamily; 

KW pro-inflammatory cytokine; 8rr.9. 

XX 

OS Homo sapiens . 
XX 

PN W09217497-A. 
XX 

PD 15-OCT-1992. 
XX 

PF 23-MAR-1992; 92WO-US02317 . 

XX 

PR 29-MAR-1991; 91US-06772 1 1 . 
PR 19-DEC-1991; 91US-08 1 0782 . 
XX 

PA { GETH ) GENENTECH INC. 

XX 

PI Holmes WE, Lee J, Wood WI ; 
XX 

DR WPI; 1992-366191/44. 
DR N-PSDB; Q37107. 

XX 

PT Isolated human platelet factor 4 super-family receptor 

PT polypeptide and corresp. antibodies and DNA - useful as 

PT diagnostic and screening agents, and for treating inflammation or 

PT PF4AR-mediated disorders 

XX 



PS Claim 7; Fig 5; 78pp; English. 

XX 

CC The IL-8 receptor cDNA sequence was isolated (see Q29505) and a 

CC 874bp sub-fragment of the coding sequence was used as a probe to 

CC screen human cell line HL60 and human peripheral blood lymphocyte 

CC cDNA libraries. Two new gene sequences were found that are clearly 

CC related to the IL-8 receptor. One of these was contained in clone 

CC 8rr.9 and is predicted to encode an amino acid sequence which is 

CC 36r and 38% identical with the high and low affinity IL-8 receptor 

CC sequences, respectively. See also Q37107. 
XX 

SQ Sequence 372 AA; 



Query Match 33.3%; Score 635; DB 13; Length 372; 

Best Local Similarity 40.8%; Pred. No. le-66; 

Matches 147; Conservative 50; Mismatches 145; Indels 18; Gaps 8; 



Qy 


20 


LENFSSSY DYGENESD-SCCTSPPCPQ DFSLNFDRAFLPALYSLLFLLGLLGNG 

Ml : 1 : 1 : 1 1 : II : I | : | | | | : | | | | : : | | 
lenledl f weldr ldnyndt si venhlcpa tegplmas f kavf vpvaysli f llgvignv 


72 


Db 


il. 


70 


Qy 


73 


AVAAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFGSGLCKVAGALFN 

1 : I 1 I I : | | ! | | | | | | | | | || : : 1 1 1 i : I I I I | 
lvlvilerhrqtrsstetf If hlavadlllvf ilpf avaegsvgwvlgt f lcktvialhk 


132 


Db 


71 


130 


Qy 


133 


INFYAGALLLACISFDRYLNI VHATQLYRRGPPARVTLTCLAVWGLCLLFALPDFI FL-- 
: i M : 1 1 1 1 1 1 : II 1 1 1 1 1 1 II : : | | : 1 : | III: : | 
vnf ycsslllaciavdrylaivhavhayrhrrllsihitcgtiwlvgf llalpeilf akv 


190 


Db 


131 


190 


Qy 


191 


-SAHHDERLNATHCQYNFPQVGRT ALRVLQL VAGFLLPLLVMAYCYAHILAVL-LV 

II: 1 1 : : 1 II 1 1 i 1 1 1 1 : 1 1 1 : 1 1 : : 1 
sqghhnnsl--prct f sqenqaethawf tsrf lyhvagf 1 lpmlvmgwcyvgvvh rl rqa 


244 


Db 


191 


248 


Qy 


245 


SRGQRRLRAMRLVVVVVVAFALCWTP YHLVVLVDI LMDLGALARNCGRESRVDVAKSVTS 

1 : i : 1 : 1 : : : 1 I I I j : | I | : | : : | | | ( : | : | | : : 
qrrpqrqkavrvailvtsif f lcwspyhivi f ldt larl kavdntcklngslpvai tmce 


304 


Db 


249 


308 


Qy 


305 


GLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEAS 

II 111111:111 : 11:111 I : 1 1 1 1 1 1 1 : | : 

flglahcclnpmlytfagvkf rsdls rll t klgctgpaslcql f pswrr-sslsesenat 


364 


Db 


309 


367 



RESULT 3 
S42628 

G protein-coupled receptor Gpcr6 - mouse 

N; Alternate names: interleukin-8 receptor homolog; muBLRl protein 
C; Species: Mus musculus (house mouse) 

C;Date: 07-Sep-1994 #sequence_revision 26-May-1995 #text_change 19-May-2000 
C;Accession: S42628; C48909 

R; Kaiser, E . ; Foerster, R.; Wolf, I.; Ebensperger, C; Kuehl, W.M.; Lipp, M. 
Eur. J. Immunol. 23, 2532-2539, 1993 

A;Title: The G protein-coupled receptor BLR1 is involved in murine B cell differentiation 
and is also expressed in neuronal tissues. 
A;Reference number: S42628; MUI D : 94 0092 1 1 
A;Accession: S42628 

A; Status: preliminary; nucleic acid sequence not shown 
A; Molecule type: DNA 
A; Residues: 1-374 <KAI> 

A;Cross-references : EMBL:X71788; NID:g433946; PIDN : CAA50 673 . 1 ; PID:g433947 
R;Wilkie, T.M.; Chen, Y.; Gilbert, D.J.; Moore, K.J.; Yu, L.; Simon, M.I.; Copeland, 
N.G.; Jenkins, N.A. 
Genomics 18, 175-184, 1993 

A;Title: Identification, chromosomal location, and genome organization of mammalian G- 
protein-coupled receptors. 

A;Reference number: A48909; MUI D : 94 1 16980 
A;Accession: C48909 

A; Status: preliminary; nucleic acid sequence not shown 



A;Molecule type: mRNA 

A; Residues: 151-269 <WIL> 

A/Cross-references: GB:L20332; NID:g438798; PIDN : AAA16852 . 1; PID:g438799 
C; Superf amily : vertebrate rhodopsin 

C;Keywords: G protein-coupled receptor; transmembrane protein 

Query Match 33.6%; Score 640; DB 2; Length 374; 

Best Local Similarity 40.9%; Pred. No. 3.5e-50; 

Matches 143; Conservative 54; Mismatches 139; Indels 14; Gaps 6; 

Qy 23 FSSSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPALYSLLFLLGLLGNGAVAAVLLSRR 82 

: I = 1 = : I : I I : I I : I I I I : ! ( I I : : I I I : | | 

Db 26 YSNSTEI PLQDSNFCST VEGPLLTSFKAVFMPVAYSLI FLLGMMGNI LVLVILERHR 82 

Qy 83 TALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFGSGLCKVAGALFNINFYAGALLL 142 

I I f : I ! I I I I I I I III || : : I I I I : I I I | MM : I I : 

Db 83 HTRSSTETFLFHLAVADLLLVFILPFAVAEGSVGWVLGTFLCKTVIALHKINFYCSSLLV 142 

Qy 143 ACIS FDRYLN I VHATQLYRRGPPARVTLTCLAVWGLCLLFALPDFI FLSA HHDERLN 199 

111:11111111 III : : | | | : | I I I I I :: I I : : : I 

Db 143 ACIAVDRY LAI VHAVHAYRRRRLLSI HITCTAIWLAGFLFAL PELL FAKVGQPHNN DSL- 201 

Qy 200 ATHCQYNFPQVGRT ALRVLQLVAGFLLPLLVMAYC YAHI LAVLL-VSRGQRRLRAM 254 

I : : I II : I I I I I : I I I : I I : : II I : I : I : 

Db 202 -PQCTFSQENEAETRAWFTSRFLYHIGGFLLPMLVMGWCYVGVVHRLLQAQRRPQRQKAV 260 

Qy 255 RLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAKSVTSGLGYMHCCLN 314 

I : : : I I I I I : I I I : I : : I I I I : : I : I I : : I 

Db 261 RVAILVTSI FFLCWSPYHI VI FLDTLERLKAVNSSCELSGYLSVAITLCEFLGLAHCCLN 320 

Qy 315 PLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEAS 364 

I : I I I I I I I I ■ I I : M I I I : I I I I I : I : 

Db 321 PMLYTFAGVKFRSDLSRLLTKLGCAGPASL-CQLFPNWRKSSLSESENAT 369 



RESULT 5 
A53611 

interleukin-8 receptor type B - human 
C; Species: Homo sapiens (man) 

C;Date: 07-Oct-1994 # sequence_revision 12-Apr-1996 #text_change 05-Nov-1999 

C;Accession: 137898; 138712; A53611; A39446 

R;Ahuja, S.K.; Shetty, A.; Tiffany, H.L.; Murphy, P.M. 

J. Biol. Chem. 269, 26381-26389, 1994 

A;Title: Comparison of the genomic organization and promoter function for human 

interleukin-8 receptors A and B. 

A;Reference number: 137898; MUID : 95014 47 6 

A;Accession: 137898 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues: 1-360 <RES> 

A;Cross-references : EMBL:U11869; NID:g511801; PIDN : AAB60 65 6 . 1 ; PID:g511803 
A;Accession: 138712 
A;Status: preliminary 
A;Molecule type: mRNA 
A; Residues: 1-15 <RE2> 

A;Cross-references : EMBL:U11872; NID:g511808; PIDN: AAA 64380.1; PID:g511809; EMBL:U11873; 
NID:g511810; PID:g511811; EMBL:U11874; NID:g511812; PID:g511813; EMBL:U11875; 
NID:g511814; PID:g511815; EMBL:U11876; NID:g511816; PID:g511817; EMBL:U11877; 
NID:g511818; PID:g511819; EMBL:U11878; NID:g511820; PID:g511821 
R;Sprenger, H.; Lloyd, A.R.; Lautens, L.L.; Bonner,' T.I.; Kelvin, D.J. 
J. Biol. Chem. 269, 11065-11072, 1994 

A;Title: Structure, genomic organization, and expression of the human interleukin-8 
receptor B gene. 

A;Reference number: A53611; MUID : 9420927 3 
A;Accession: A53611 
A;Status: preliminary 
A;Molecule type: DNA 
A;Residues: 6-360 <SPR> 



A; Cross-references : GB:M99412; GB:L19593 
R;Murphy, P.M.; Tiffany, H.L. 
Science 253, 1280-1283, 1991 

A;Title: Cloning of complementary DNA encoding a functional human in t e rleukin- 8 receptor. 
A/Reference number: A39446; MUI D : 9 1 3 68200 
A/Accession: A39446 

A; Status: preliminary; nucleic acid seguence not shown 

A; Molecule type: mRNA 

A; Residues: 6-360 <MUR> 

A; Cross-re ferences : GB:M73969 

C;Comment: This receptor, unlike I L 8 R A , binds several peptides besides in t e rleu kin-8 , 
including GRO, NAP-2, and ENA-78. 
C; Genetics : 

A;Gene: GDB : IL8RB; IL8RA 

A;Cross-references : GDB:127868; OMIM:146928 

A; Map position: 2q35-2q35 

C ; Superf amily : vertebrate rhodopsin 

C; Keywords: G protein-coupled receptor; transmembrane protein 



Query Match 33.2%; Score 633; DB 2; Length 360; 

Best Local Similarity 40.1%; Pred. No. 1.4e-49; 

Matches 141; Conservative 51; Mismatches 130; Indels 30; Gaps 6; 

Qy 25 SSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPALYSLLFLLGLLGNGAVAAVLLSRRTA 84 

1:1 I : I I : I I : : I : : I : I : I I I I I I I I I : I I 

Db 21 SNYS YSSTLPPFLLDAAPCEPE-SLEINKYFVVI I YALVFLLSLLGNSLVMLVI LYSRVG 79 

Qy 85 LSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFGSGLCKVAGALFNINFYAGALLLAC 144 

111:11:11:111 I I I I : I I I : I 1 : I I I I I : I I I : I ! I I I | 

Db 80 RSVTDVYLLNLALADLLFALTLPI WAASKVNGWI FGTFLCKVVSLLKEVNFYSGI LLLAC 139 

Qy 145 I S FDRYLNI VHATQLYRRGPPARVTLT CLAVWGLCLLFALPDFI FLSAHHD 195 

I I I I I I I I M i : III I I :: I I I I I I I I : I : 

Db 140 ISVDRYLAI VHATR TLTQKRYLVKFICLSIWGLSLLLALPVLLFRRTVYS 189 

Qy 196 ERLNATHCQYNFPQVG RTALRVLQLVAGFLLPLLVMAYCYAHILAVLLVSRGQR 24 9 

: = I : : I I I I : I I I : : I I I : I : I I I I : : 

Db 190 SNVSPA-C YEDMGNNTANWRMLLRILPQSFGFIVPLLIMLFCYGFTLRTLFKAHMGQ 245 

Qy 250 RLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAKSVTSGLGYM 309 

: I I I I : : I I : I I I I I I : I! : I I I I : I I : : I I I I I : 

Db 24 6 KHRAMRVI FAVVLI FLLCWLPYNLVLLADTLMRTQVIQETCERRNH1 DRALDATEILGIL 305 

Qy 310 HCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETS 361 

I I I I I i : I I I : I I I I : : I I : : I : I i I I 

Db 306 HSCLNPLI YAFIGQKFRHGLLKILAIHGLISKDSLPKDSRPSFVGSSSGHTS 357 



RESULT 1 
CCR3_HUMAN 

ID CCR3_HUMAN STANDARD; PRT; 3 68 AA. 

AC P49682; 

DT 01-FEB-1996 (Rel . 33, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update} 

DT 30-MAY-2000 {Rel. 39, Last annotation update) 

DE C-X-C CHEMOKINE RECEPTOR TYPE 3 (CXC-R3) (CXCR-3) . 

GN CXCR3 OR GPR9 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
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